Large Scale Production of Syntactic Annotations to Move Forward
نویسندگان
چکیده
This article presents the methodology of the PASSAGE project, aiming at syntactically annotating large corpora by composing annotations. It introduces the annotation format and the syntactic annotation specifications. It describes an important component of the methodolgy, namely an WEB-based evaluation service, deployed in the context of the first PASSAGE parser evaluation campaign.
منابع مشابه
Anr Mdca Proposal Passage Produire Des Annotations Syntaxiques À Grande Échelle Pour Aller De L'avant Large Scale Production of Syntactic Annotations to Move Forward
متن کامل
Large Scale Syntactic Annotation of Written Dutch: Lassy
The construction of a 500-million-word reference corpus of written Dutch has been identified as one of the priorities in the STEVIN programme. The focus is on written language in order to complement the Spoken Dutch Corpus (CGN) [13], completed in 2003. In D-COI (a pilot project funded by STEVIN), a 50-million-word pilot corpus has been compiled, parts of which were enriched with verified synta...
متن کاملThe Impact of Different Frequency Patterns on the Syntactic Production of a 6-year-old EFL Home Learner: A Case Study
This longitudinal study investigated the impact of different Frequency Patterns (FP) on the syntactic production of a six-year-old EFL learner in a home context. Target syntactic constructions were presented using games and plays and were traced for their occurrence patterns in input and output. Following each instruction period, the constructions were measured through immediate and delayed ora...
متن کاملUse of Syntactic and Semantic Filters for Lexical Acquisition: Using WordNet to Increase Precision
This paper describes an approach to automatic extraction of verb meanings from machine-readable resources for the construction of large-scale knowledge sources. We describe semantic lters designed to reduce the number of incorrect assignments made by a purely syntactic technique. We report on our results of disambiguating the verbs in the semantic lters by adding WordNet sense annotations. 1 We...
متن کاملAn annotation scheme for Persian based on Autonomous Phrases Theory and Universal Dependencies
A treebank is a corpus with linguistic annotations above the level of the parts of speech. During the first half of the present decade, three treebanks have been developed for Persian either originally or subsequently based on dependency grammar: Persian Treebank (PerTreeBank), Persian Syntactic Dependency Treebank, and Uppsala Persian Dependency Treebank (UPDT). The syntactic analysis of a sen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008